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Phosphatase Regulation of Nucleic Acid 

Transcription 

Background 

£0001] The regulation of cellular gene expression occurs 
primarily at the level of transcription initiation by RNA 
polymerase. Regulated transcription initiation by rna 
polymerase II in higher eukaryotes involves the formation 
of a complex with general transcription factors at 
promoters . 

[0002] The largest subunit of RNA polymerase (RNAP) ii 
contains a C-terminal domain (CTD) comprised of multiple 
repeats of the consensus sequence Tyr^Ser^Pro^Thr^Ser^Pro^Ser' , 
The progression of RNAP ii through the transcription cycle 
is regulated by both the state of CTD phosphorylation and 
the specific site of phosphorylation within the consensus 
repeat (1,2). The emerging overview of this process is that 
unphosphorylated RNAP ii, designated RNAP iia, enters the 
preinitiation complex where phosphorylation of Ser 5 is 
catalyzed by TPIIH (which contains cdk7/cyclin H subunits) 
concomitant with transcript initiation (3,4). This 
generates the phosphorylated form of RNAP ii, designated 
RNAP no. ser 5 phosphorylation facilitates the recruitment 
of the 7-methyl G capping enzyme complex (5-9) . 
Phosphorylation of Ser 2, is catalyzed by the cyclin- 
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dependent kinase P-TEFb (which contains cdk9/cyclin T 
subunits) (10,11) . Although Ser 5 phosphorylation precedes 
Ser 2 phosphorylation (12) , it is not clear if the 
dephosphorylation of Ser 5 precedes Ser 2 phosphorylation. 
During transcript elongation in yeast there is extensive 
turnover of Ser 2 phosphates mediated by FCPl and Ctkl, the 
putative PTEFb homolog (13) . Finally, dephosphorylation of 
Ser 2 by the FCPl phosphatase regenerates RNAP ua thereby 
completing the cycle (14) . 

[0003] FCPl is a class C (PPM) phosphatase containing a 
BRCT domain that is required for interaction with RNAP li 
and dephosphorylation of the CTD (15,16). FCPl interacts 
with and is stimulated by RAP74, the larger subunit of 
TFIIF (16,17). Class C phosphatases are resistant to 
inhibitors that block other classes of Ser/Thr phosphatases 
and bind Mg^* or Mn^- in the binuclear metal center of the 
catalytic site (18,19). The i|»MDXDX (T/V) motif (where ^= 
hydrophobic residue) present in the FCPl homology domain 
characterizes a subfamily of class C phosphatases with both 
Asp residues being essential for activity (20) . 
t0004] synthetic lethality is observed between mutant FCPl 
and reduced levels of rnap ii in S.cerevisiae and S. pombe, 
indicating that FCPl is an essential gene (20,21) . It 
remains uncertain whether the activity of FCPl accounts for 
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the depho^phorylatlon of both Ser 5 and Ser 2 and whether 
this is the sole activity that catalyzes ctd 
dephosphorylation. Mutations in PCPI lead to increased 
Phosphorylation of Ser 2 suggesting that it functions in 
vivo in the dephosphorylation of Ser 2 (13) . yeast FCPl 
appears specific for Ser 2 phosphate when synthetic 
peptide's are used as substrate (22) . However, „«™™iian 
FCPl dephosphorylates both Ser 2 and Ser S in vitro in the 
context of native HNAP II (23). Furthermore,' modifications 
such as that catalyzed by the Essl/Pi„i peptidyl prolyl . 
isomerase ™ay alter the activity and specificity of PCPI 
(24-26) . Given the inportance of CTD phosphorylation in 
gene egression (27, , it is essential to if additional 

CTD phosphatases exist and if so. their specificity and the 
n«chanisms by which they are targeted to RHaP ll at 
discrete stages of the transcription cycle. 
10005, Although FCPl is the only reported CTD phosphatase, 
exandnation of the databases reveals additional genes that 
consist principally of a domain with homology to the CTD 
phosphatase domain of FCPl. Three closely related human 
genes encoding small proteins with CTD phosphatase domain 
homology, but laclcing a BRCT domain, have been identified, 
in the present study we show that a gene located on 
Chromosome 2 encodes a nuclear CTD phosphatase. This 
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protein preferentially dephosphorylates Ser 5 within the 
CTD of RNAP II and is stimulated by RAP74 . Expression of 
this small CTD ghosphatase (SCPl) inhibits activated 
transcription from a variety of promoter- reporter gene 
constructs whereas expression of a mutant lacking 
phosphatase activity enhances transcription. This newly 
identified small CTD phosphatase appears to play 'an 
important role in the regulation of RNAP II transcription. 

Detailed Description 
[0006] The invention relates to methods of modifying gene 
transcription by modulating the activity of small CTD 
phopsphatases (SCP's). The invention includes methods for 
identifying compounds that bind to, or interact with, one 
or more SCP proteins or the DNA/RNA encoding the SCP 
proteins and, thus, modifying the activity of an SCP 
protein on RNA polymerase II, or on other transcription 
factors essential to gene transcription. Compounds that 
bind to, or interact with, one, or more SCP proteins or the 
DNA/RNA encoding the proteins can inhibit or enhance the 
activity of the RNA polymerase II, thus, inhibiting or 
enhancing gene transcription. For example, antisense, 
nonsense or interfering (i.e., RNAi) nucleotide sequences 
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. that modulate SCP translation or transcription can effect 
the RNA polymerase Il-mediated gene transcription. 
[0007] The invention further relates to methods of 
identifying conpounds that modulate SCP activity by in 
vitro transcription. This' invention further relates to 
methods of identifying substances that modify gene 
transcription, and methods of treating disease conditions 
resulting from insufficient, increased or aberrant 
production of SCP proteins. These methods include the use 
Of substances that bind to, or interact with, the SCP 
proteins, (naturally occurring and biologically active, 
also referred to herein as wildtype SCP proteins) genes 
encoding the SCP proteins, SCP messenger RNA, or the use of 
genetically altered SCP proteins. 

[0008] compounds used in the methods described herein can 
be proteinaceous in nature, such as peptides (comprised of 
natural and non-natural amino acids) and peptide analogs 

(comprised of peptide and non-peptide components) , or can 
be non-proteinaceous in nature, such as' small organic 
molecules. The substance can also be a genetically 
engineered SCP protein with an altered amino acid sequence. 
These substances would be designed to bind to, or interact 
With the SCP protein based on the DNA or amino acid- 
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sequences of the SCP proteins described herein, or 
antibodies reactive with the SCP proteins described herein. 
[0009] For example, a substance can be identified, or 
designed, that specifically interferes with the phosphatase 
activity of one, or more, SCP proteins thereby inhibiting 
RNA polymerase II holoenzyme activity. Monoclonal or 
polyclonal antibodies (e.g, the polyclonal antibodies 
described herein) specific for one, or more, of the SCP 
proteins can also be used to prevent, or inhibit, the SCP 
proteins from participating in the initiation of gene 
transcription . 

[0010] The invention provides methods (also referred to 
herein as a "screening assays") for identifying modulators, 
i.e., candidate or test compounds or agents (e.g., 
pept{Ldes, peptidomimetics, small molecules or other drugs) 
which bind to SCP proteins, have a stimulatory or 
inhibitory effect on, for example, SCP expression or SCP 
activity, or have a stimulatory or inhibitory effect on, 
for example, the activity of an SCP target molecule. 
[0011] In one embodiment, the invention provides assays for 
screening candidate or test compounds which are target 
molecules of a SCP protein or polypeptide or biologically 
active portion thereof. In another embodiment, the 
invention provides assays for screening candidate or test 
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compounds which bind to or modulate the activity of a SCP 
protein or polypeptide or biologically active portion 
thereof. The test compounds of the present invention can 
be obtained using any of the numerous approaches in 
combinatorial library methods Jcnown in the art, including: 
biological libraries; spatially addressable parallel solid 
phase or solution phase libraries; synthetic library 
methods requiring deconvolution; the 'one-bead one- 
compound- library method; and synthetic library methods 
using affinity chromatography selection. The biological 
library approach is. limited to peptide libraries, while the 
other. four approaches are applicable to peptide, non- 
peptide Oligomer or small molecule libraries of compounds 
(Lam, K.S. (1997) Anticancer Drug Des. 12:145). 
t0012] In one embodiment, an assay is a cell-based assay in 
which a cell which expresses a SCP protein or biologically 
active portion thereof is contacted with -a test compound 
and the ability of the test compound to modulate SCP 
.activity determined. Determining the ability of the test 
compound to modulate SCP activity can be accomplished by 
monitoring the bioactivity (i.e., phosphatase activity) of 
the SCP protein or biologically active portion thereof. 
The cell, for example, can be of mammalian origin. 
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10013] SCPl and SCP2 were obtained as EST clones from 
Resgen. The full-length cDNA for SCPl (261 aa, BE300370) , 
the CDNA encoding the spliced variant of SCPl (214 aa, 
AL520011, and SCP2 (AL520463) were subcloned into Ecoll- 
Xhol sites of PGEX4T-1 and pcDNA3Plag vectors by PCR. The 

D38N mutant of SCPl 261 and the corresponding mutant 
Of SCPl 214, D48E and D50N, were generated by QuikChange 
(Stratagene) . All constructs were verified by sequencing. 
GST fusions were purified by glutathione -sepharose 
Chromatography and SCPl 261 was generated by cleavage at 
the thrombin site encoded in the vector. Recombinant PCPl 
was expressed and purified as described previously (23) . 
t0014J Human recombinant casein kinase II (CKII) and mouse 
recombinant MAPK2/ERK2 were obtained from Ui>state 
Biotechnology. Human CTDK1/CTDK2 were purified as described 
by Payne and Dahmus (28) . Human TFIIH was obtained as 
described (29) . Human P-TEFb was partially purified from 
HeLa S-100 extract by chromatography on Heparin-Sepharose 
(Amersham Biosciences) , deae ISHR (Millipore) HiTrap s and • 
Phenyl -superose (both from Amersham Biosciences) . P-TEPb 
was dialyzed against 25 mM Hepes, pH 7.9, 20% glycerol, 25 
™M KCl, 0.1 rm EDTA, 1 DTT, 1 rrM PMSF. Human rec6aa,inant 
Cdc2 kinase was purchased from New England Biolabs. Rabbit 
anti-SCPl IgG was prepared by ammonium sulfate 
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fractionation and protein G-sepharose chromatography, rnap 
II antibodies (8WG16, H5 and H14) were obtained from 
Covance . 

[0015] Preparation and Purification of "p-rnap ho Isozymes 
and '^P-GST-CTDo: Calf thymus RNAP IIA was purified by the 
method of Hodo and Blatti (30) with modifications as 
described by Rang and Dahmus (31) . Specific isozymes of ^^P- 
labeled RNAP no were prepared by phosphorylation at the 
most C-terminal serine (CKII site) in the largest subunit 
of pxirified RNAP IIA with recombinant CKII and [y-"p]ATP, 
followed by CTD phosphorylation in the presence of 2 mw ATP 
with either piirified CTDK1/CTDK2, TPIIH, P-TEPb, 
recombinant MAPK2/ERK2 or recombinant Cdc2 kinase. The RNAP 
IIO isozymes were individually purified over a DE53 column 
with a step elution of 500 raM KCl (28) . Because only the 
most C-terminal serine is 'labeled with ^^'P and lies outside 
the consensus repeat, dephosphorylation by CTD phosphatase 
results in an electrophoretic mobility shift in SDS-PAGE of 
subunit no to the position of subunit Ila without the loss 
of label. Similarly, «P-labeled GST-CTDo was prepared from 
GST-CTDa by CKII followed by MAPK2/ERK2. GST-CTDo was 
purified over a glutathione -agarose column with a step 
elution of 15 mM glutathione. 
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[0016] Pto.pt.t.se Assays: PNGP reaction mixtures (200 pi) 
containing 50 rm Trls-acetate, pH 5.5, 10 ^gcl^, 0.5 ^ 
DTT, 10% glycerol, 20 xnM PNOP and recombinant proteins were 
incubated for at 30-C for I hr. Tl.e reactions were quenched 
by adding 800 pi of 0.25 N NaOH. Release of pNG was 
determined by measuring *4io. 

10017] N-terminal biotinylated CTD phosphopeptides, 
comprised of 4 tandem repeats YSPTSPS and containing 
Phosphoserine at position 2 or position 5, were synthesized 
(Alph. Di^^ostics, San Antonio, TX) . Phosphatase reaction 
mixtures (50 pi, containing 50 mM Tris-acetate, pH 5.5, lo 
MgCl., 0.5 ^ DTT, 10% glycerol, 25 pM of phosphopeptide 
and wild type or mutant SCPl were incubated for 60 mins at 
370C. The reactions were quenched by adding 0.5 ml of 
malachite green (Biomol) . Phosphate release , was measured at 

and quantifiedrelative to a phosphate standard curve. 
[0018] CTD phosphatase assays utilizing RNAP ho and GST- 
CTDo as substrate were performed as described previously • 
(32, With minor modifications. Reactions were performed in 
20 pi Of CTD phosphatase buffer (50 mM Tris-HCl, pH 7.9, 10 
n*. Mgcl., 20% glycerol, 0.025% Tween 80, o.l,^*. edta, 5 nl, 
DTT, in the presence of 20 mM KCl. Each reaction contained 
specified amounts of GST-CTDo and/or RNAP no and was 
carried out in the presence of 7 pmol RAP74. Reactions were 
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anitiatea by the addition of pcpx or SCPl and incubated at 
30-C tor 30 Minutes. Assays were terminated by the addition 
Of SX I^e^i bu«er, and RK« xi sobunits and GST-CTD were 
resolved on a S* SDS-PAGE gel. The gel ir^ges were 
developed by autoradiography and scanned by .^lecular 
Dynamics Inage Scanner Stom. Seo in the phosphor screen 
mode, oata were ^antitatively analyzed by l:«geQuant 
software . 

tool,, ris^e culture and rransfectionsH,un«n 293, COS, and 
CVl cells were gro», at 37»C in um» supple„«nted with 10» 
nom«l calf serum (BRI.) . Subconfluent cells were 
transfected in « well tissue culture dishes using Bffectene 
tQiagen, according to the manufacturer's instructions. 
Reporter and activator plasmids ,100 ng each, and Flag SCPI 
(80 ng, or its mutant were used per well. For T3 and PPARy 
transfections. 20 ng RXR plasmid was also added. The a^unt 
Of ligands used are a, follows: 100 nM T3, 1 p„ PPAR,60.a43 
(Ligand Phar,^ceutical3, . 100 nM dexamethasone. I«xi and 
E47 expression plasmids (100 ng, were cotransfected with 
100 ng of the rat insulin promoter-luciferase reporter 
construct with the indicated concentrations of SCPi and 
Phosphatase-minus SCPl expression plasmids. The total 
amounts of transfected was kept constant by the " 
addition Of en^ty vector. Cells were harvested 48 h^s after 
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transfections and cellular extracts were assayed for 
luciferase activity using Luciferase Assay System (Promega) 
according to the manufacturer's instructions. 
[00201 Ximnunofluorescence: Cells grown on coverslip were 
fixed in 2% paraformaldehyde, neutralized and blocked using 
2.5% FCS/PBS. Rabbit polyclonal IgG 6703 was used at 1:100 
dilution, followed by goat anti-rabbit IgG H+L chains 
conjugated to Alexa Fluor 488 (1:250). Mouse anti EEAl was 
used at 1:1000 followed by goat anti-mouse IgG conjugated 
to Alexa Fluor 594 (1:250). Omission of primary antibodies 
was used as negative control. The coverslips were viewed 
using the Zeiss Axiophot which is equipped with a Hamamatsu 
Orca ER f irewire camera that runs on Improvision Openlab 
3.0.9 software. 

[0021] Immunoprecipitationsz For immunoprecipitation 
experiments, 75% confluent COS7 cells from a 10 cm dish 
were harvested in lysis buffer (PBS containing i% NP40, 1 
mM DTT and protease inhibitors) . Lysates were incubated " 
with 20 pi sepharose-conjugated anti-SCPl (6703) IgG at 4«»C 
for 6 hr. Beads were washed with PBS and the complexes were 
evaluated by western blotting using specific anti-RNAP ii 
antibodies. Rabbit anti-SNXl antibody was used as control 
IgG. 
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10022, Th. .li^nt Of three hu^n protein, that are 
closely related to one another and have hor«.logy to the 
Phosphatase domain of human FCPl is sho«n in pigu^ ^, ■ 
contain the signature motif otvdxdx(T/V)to. SCPI, located 
on Chromosome 2c,3S, was Initially designated nuclear LIM 
interacting factor (NI,I-IP, in the genome entry (33, . The 
full-length 2« aa protein is encoded by v e^ons; a shorter 
NH, terminal splice version of 214 aa is present in EST 
databases. SCPI has -20% homology to human FCPl m the 
phosphatase domain while the 3 SCP proteins are >90% 
homologous m this region. SCP2/OS4 located on chromosome 
12ql3 was co-amplified with CDK4 in sarcomas (34) and 
SCP3/HYA22 located on chromosome 3,22 was part of a large 
chromosome deletion in a lung carcinoma cell line (35) . 
These represent a subset of proteins with putative CTD 
Phosphatase-like catalytic domains found in plants, yeast, 
nematodes and arthropods. The Drosophila and Anopheles 

genomes each contain a sincri^a h,-«ui, 

a singxe highly conserved SCP 

ortholog. The SCP proteins lack the BECT domain present in 
FCPl (Figure IB) . 

(0023, TO determine whether SCPI has phosphatase activity, 
the protein was e^ressed as a GST- fusion and both SCPI 2« 
and SCPI 214 were assayed using phop as substrate. As 
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reported for PCPl from S. pombe (22) utilizing PNeP as 
substrate, the pH optimum for SCPI phosphatase activity is 
near 5 (Figure 2A) . Phosphatase activity was Mg--dependent 
and resistant to the phosphatase inhibitors okadaic acid ' 
and microcystin (Figure 2B) . Ca- could not substitute for 
Mg-. Mutations of Asp96 to Glu (D96E) had little to no 
effect on phosphatase activity (data not shown) whereas 
nrutating Asp98 to Asn (D98N) in conjunction with the D96E 
nnatation completely abolished phosphatase activity (Figure 
2B) . SCPI is thus a class 2C phosphatase whose activity is 
dependent on acidic residues in the conserved DXD motif. 
SCP2 exhibited similar phosphatase activity (Figure 2B) . 
[0024] To determine whether GST-SCPl 214 has CTD 
Phosphatase activity, GST-CTDo and RNAP IlO.were utilized 
ag substrates and the activity of SCPI was compared 
directly with that of FCPl. Recombinant CTDo • (rCTDo) and 
KNAP iio utilized as substrate in these experiments were 
prepared by the phosphorylation of purified GST-CTDa or 

RNAP IIA with casein kinase II (CKir) i„ t-u^ 

xj. it,iu.i; in the presence of [y- 

"P]ATP followed by phosphorylation with MAPK2/ERK2 in the 
presence of excess unlabeled ATP. MAPK2/ERK2 was used in 
these initial experiments because it phosphorylates both' 
GST-CTDa and RNAP IIA with comparable efficiency (23) . As 
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expected, FCPl efficiently converts RNAP lio to Rnap IIa in 
a processive manner (Figure 2C, lanes 7-12) (23,32). Even 
high concentrations of PCPl did not result in measurable 
dephosphorylation of rCTDo (Figure 2C, lanes 1-6) 
consistent with the idea that a docking site on ^ n is 
required for activity (17) . GST-SCPl 214 catalyzed the 
dephosphorylation of both RNAP no and GST-CTDo with 
comparable efficiency (Figure 2D) . m contrast to FCPl, the 
SCPl catalyzed dephosphorylation of RNAP no appears non- 
processive in that a number of phosphorylated intermediates 
are visible in SDS-PAGE. SCPl is specific for 
dephosphorylation of, the consensus repeat in that the 
Phosphate at the CKII site is not removed. As shown below, 
the pattern of dephosphorylation varies depending on the 
CTD kinase used in the preparation of rnap no. Mutant SCPl 
(D96E, D98N) lacked activity oh either substrate (data not 
shown) . SCPl is thus a CTD phosphatase that acts, on both 
RNAP no and rCTDo. SCP2 exhibits comparable CTD 
phosphatase activity when RNAP liO is utilized as substrate 
(see Figure 4) . 
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lieptad repeat 

I0025J TO determine the specificity of SCPl „ith respect to 
«s ability to dephospho^rylate specific positions «ithin 
the consensus repeat, H.«^ no isozymes were prepared in 
vitro by the phosphorylation of rnap ix^ ^ 
Of known specificity. TFIIH, P-TEPb and MM.K2/ERK2 
preferentially phosphorylate Ser 5 when synthetic peptides 
serve as substrate (36,37) whereas Cdc2 kinase 
phosphorylates Ser 2 and Ser 5 (38) . Although the 
specificity appears relaxed when RHftp h serves as 
substrate, R«ap no prepared with Cdc2 kinase is clearly 
distinct from maP no generated by other CTD kinases (23) . 
Results presented in Figure 3a indicate that RHaP no, 
prepared by the phosphorylation of RHAP ha „ith distinct 
cm kinases, exhibit a differential sensitivity to 
aephosphorylatlon with SCPl. SCPI most efficiently 
dephosphorylates RHAP no generated by TPIIH and was unable 
to dephosphorylate RNAP no prepared with Cdc2 kinase. SCPl 
was also unable to dephosphorylate RNAP HO generated by 
^1 tyrosine kinase (data not shown) . The dephosphorylation 
Of RNAP no isozymes prepared with P-TEFb. MAPK2/ERK2 and 
CTOK1/CTOK2 occurred at a reduced rate relative to that of 
RHAP no prepared with TPIIH. Furthermore, while the 
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dephospho>:ylation reaction as^ear. proces.lve for ^ ixo 
prepared by tpiih. it i. clearly non-processlve for RK„ 
IIO generated by MAPK2/ERK2. In contrast PCPl shows no 
preference for RH^ „o generated by TPIIH and efficiently 
dephosphorylates Km^ no generated by Cdc2 kinase (23) . 
These results suggest SCPI differs from PCPl in substrate 
specificity. Showing relative preference for the 
dephosphorylation of Ser 5 in the heptad repeat. 
COOae, TO investigate the relative reactivity of SCPI for 
ser 2 and ser S, a synthetic 28 aa peptide containing 4 
heptad repeats phosphorylated exclusively on Ser 2 or on 
ser 5 was dephosphorylated in the presence of increasing 
a»«unts Of SCPl. ^ Shown in Pigure 3B, SCPl preferentially 
dephosphorylates the Ser 5 phosphopeptide compared to the 
Ser 2 Phosphopeptide. This substrate specificity contrasts 
to that reported for PCPl from S. „hich referentially 

dephosphorylate the Ser 2 phosphopeptide (22) . Ma™«lian 
FCPl, Within a comparable concentration range, did not act 
on either phosphopeptide (data not shown, . These results 
using synthetic phosphopeptide substrates confirm that SCPl 
preferentially dephosphorylates Ser s phosphate of the CTD 
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Effect of RAP74 m the activity of SCPI 
C0027, The RAP74 subunit of TFIIF stimulates CTD 
phosphatase activity of PCPl (17) . Furthermore, the 
domains of PCPl that bind RAP74 are required for FCPl- 
dependent viability in s. cerevisiae (15> . Therefore, it 
was Of interest to determine if KAP74 can also influence 
the activity of SCP. CTO phosphatase activity was measured ' 
at low en^e concentrations to more readily detect 
stimulatory effects of PAF74. As shown in Figure 4. RAP74 
shifted the dose response curve for SCPI catalysed 
dephosphorylation of ^ no to an approximately lo-,old 
lower concentration. The CTD phosphatase activity of the 
GST-fusion form^ of SCPl scPi 214 and SCP. were also 

enhanced by RAP74. In support of the conclusion that RAP74 
stimulates the activity of SCPs. RAP74 bound directly to 
GST-SCPl but not to GST (data not shown) . The binding and 
stimulatory effects of RAP74 suggest that TFIIF is ' 
important for opti^l CTD phosphatase activity for both 
FCPl and SCPl. 

SCPl is located in the nucleus ..sociated with map „ 
t0028, Although SCPl lac)cs an obvious nuclear localization 
sequence, it is found in the nucleus. I^unof luorescence 
microscopy using a rabbit polyclonal anti-SCPl antibody 
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demonstrated nuclear localization of endogenous SCPI in 
COS7 cells (Figure 5 B) . Co-staining with DAPI for 
nuclear identification and with the early endosomal marker 
EBAl for cellular detail confirmed the specific 
localization of SCPI in nuclei (Figure 5 panels A and B) . 
10029] Co-immunoprecipitation was used to assess the 
association of SCPI with I^p h. Sepharose-in«nobilized 
anti-SCPl igo 6703 was used to inununoisolate SCPI fron, COS7 
cells, immunoisolates were resolved by SDS-PAGE and blotted 
With anti-RNAP li antibodies. As shown in Figure 5C, RNap 
II was present in SCPI inununoprecipitates indicating that 
SCPI and I^^P II either interact directly or are in the 
same macromolecular complex. To determine whether SCPI 
preferentially associated with either Ser 2 or Ser 5 
Phosphorylated RNAP lio, lysates were prepared in the 
presence of EDTA, to inhibit phosphatase activity. SCPI 
immunoprecipitates were then blotted with monoclonal 
antibodies specific for Ser 2 phosphate (H5) and Ser 5 
Phosphate (H14) . Both forms of RNAP no were present in 
COS7 cell lysates. Ser 5 phosphate -enriched RNAP no 
appeared to be preferentially associated ^ith SCPi in 
immunoprecipitates as indicated by the ratios of co- • 

immunoprecipitated RNAP no relaH-o--:. 4.v. 

N/if J.J.U relative to the amount of rnap 

IIO contained in the extract (Figure 5C) . 
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SCPl .ff,ct. raou. 11 transcription In vi«, 
t0030, TO assess the e«ect o* SCPl on transcription in 
Vivo, the activity o. a variety of luci.erase reporter ,ene 
constructs was examined in the presence or absence of 
cotransfectea SCPl. Targeting a Oal .-^ 
SCPX fusion or the phosphatase-inactive Gal 4-SCPl mutant 
upstream of a thy^i^ine kinase pro,„ter-luciferase reporter 
<Oal 4-TK-l.uc, haa no si^ficant effect on transcriptional 
activity (Pigure . Interestingly, untethere. SCPl in the 
presence of several reporter constructs had no si^if icant 
effect on reporter gene expression whereas the inactive 
-tant resulted in a significant stimulation of expression 
fro. the EXA^uc. PGL3-I.UC and Gal4-IAl*.^uc constructs 
(Fi^e .B, . The phosphatase-ninus SCPl „„tant increased 
lucif erase activity 1.5- to 6-fold. 

t00311 In contrast, «T or phosphatase-inactive SCPl 
affected reporter gene expression from a variety of 
regulated promoters. Luciferase activity from a Gal 4-TK- 
Luc reporter that was strongly stimulated hy co-expressing 
a Gal 4-VP16 fusion protein ,30-fold stimulation, was 
strongly inhibited by SCPl <Pigure 6C, . m contrast 
Phosphatase-inactive SCPl enhanced Gal 4-Vpi6-stimulated 
activity about 2 -fold. 



20 



Attorney Docket Ho. 1S670-003P01/SD2003-061 

[0032] opposing effects of active SCPl and the inactive 
mutant SCPl were observed using a number of inducible 
promoter-reporter constructs. Ligand- activated T3 receptor 
activity on a DR.4 TRE-TK-Luc reporter gene was inhibited 
by active SCPl and enhanced by inactive SCPl (Figure 6D) . 
Similar results were obtained when the C-terminus of T3Rp 
was fused to the Gal binding domain (Gal 4-T3RPC, and 

targeted to Gal 4-tk-Luc. SCPl also inhibited 
dexamethasone-stimulated glucocorticoid receptor activity 
on a GRE-TK-Luc construct whereas mutant SCPl significantly 
enhanced activity (Figure SB) . Finally, SCPl inhibited 
ligandactivated PPARy receptor activity assayed on a PPARy 
promoter response element and mutant SCPl enhanced activity 
(Figure 6D) . A similar response pattern was observed when 
the Gal 4-DNA binding domain was fused to the C-terminus of 
PPARy and targeted to Gal 4-l^-i.uc (data not shown) . The 
same pattern of responses was observed in HEK293, COS-7 and 
CV-l cells (data not shown) . 

10033) Although there w^s variability in the extent of 
inhibition and simulation of transcription observed with 
SCPl and mutant SCPl, respectively, the overall patterns 
were similar ueing additional inducible systems such as , 
estrogen and retinoic acid receptor responsive promoters. 
The extent of inhibition and stimulation likely results 
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iron, variable expression levels transfected proteins as 

well as potentially different rate lifting steps In 

transcription from specific promoters. 

10034] The competing effects of srpi --j 

3 •.•.^^^a or scpi and mutant SCPl were 

further examined using a rat Insulin promoter-luclferase 
construct. There Is strong synergy on this promoter between 
the bHia protein B47 and the LIM-homeodomaln protein LMxi 
which bind to adjacent Dm target sites (39,40). Co- 
expresslon of l^i and E47 enhanced luclfsrase activity 25- 
fold from the rat insulin 1 promoter (Pigure 6B, . When 
Phosphatase-lnactlve SCPl was held constant, increasing 
amounts of SCPl Inhibited luciferase expression (Figure 
sm . in the presence of mutant SCPl, the SCPl inhibition 
curve was right-shifted (-20% inhibition with 40 ng SCPl 
Plasmld plus 20 ng mutant SCPl plasmld vs. >9o» inhibition 
With 40 ng SCPl plasmld alone (Plgure 6E, , . „lth a constant 
input Of SCPl Plasmld, Increasing amounts of mutant SCPl 
not only blocked the inhibitory effects of SCPl, but 
enhanced activity significantly. The stimulatory effects of 
transfected phosphatase-lnactlve SCPl are thus consistent 
with it acting as a dominant negative. 

C003SJ Acquisition Of the CTD of RKAP 11 allows extensive 
protein interactions and is thought to have been an 
important step in the evolution of complex patterns of 
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regulated gene e=^.esslo„ (41) . ,tost l^rtantly, the CTD 
can exist in multiple conformations thereby facilitating 
the recruitment of different multiproteln complexes at 
specific points in the transcription cycle. Both 
the Site of phosphorylation within the consensus repeat, 
sor 2 or ser 5, and the extent of phosphorylation of the 
CTD control many aspects of rhap ii transcription, 
including the recruitment of ^ n to the preinitiation 
c<,mplex, initiation, capping, elongation, splicing and • 
polyadenylation ,2V, . Ser = and s.r 5 within the consensus 
CTD repeat are essential residues and distinct CTO .cinases 
catalyze phosphorylation at these sites (42) . To date, a 
single CTD phosphatase. FCPl. has been implicated in 
removing phosphates (for review see (43)). The present 
Btudy establishes that a related family of proteins, SCPs 
have CTD Phosphatase activity, are etiolated by RAP74 and 
modulate gene expression in vivo. 

ro036, SCPX preferentially catalyzes dephosphorylation of 
"KAP no phosphorylated by TPIIH. RHAP no phosphorylated 
by P-TEPb and M.PK2/EHK2 are also dephosphorylated by SCPI 
but at a reduced rate, rhap no phosphorylated by Cdc2 
Wnase. which preferentially phosphorylates Ser 2 with some 
Phosphorylation at Ser 5, is not a substrate for SCPl The 
preferential dephosphorylation of Ser 5 by SCPl was' 
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confirmed using a 4 heptad repeat peptide substrate, 
interestingly, the specificity of SCPl contrasts with that 
Of S. pomt. FCPl Which prefers Ser 2. with a similar peptide 
substrate (22) . However, PCPl dephosphorylates Ser 2 
Phosphates and Ser 5 phosphates with comparable efficiency 
when native RNAp ho serves as substrate in vitro (23). The 
relative specificity of PCPl also differs from SCPI in that 
unlike FCPl, SCPl shows preference for the 
dephosphorylation of TFIIH phosphorylated RNAP no. 
[0037] It is clear from the results presented in Figure 2C 
and D that, when RN^ no phosphorylated with MAPK2/ERK2 is 
utilized as substrate, the specific activity of SCPl is 
substantially lower than that of FCPl. This difference in 
specific activity is in part due to the fact that 
MAPK2/ERK2 phosphorylated RNAP no is an especially poor 
substrate for SCPl (Figure 3) whereas FCPl dephosphorylates 
different isozymes of RNAP no with comparable efficiency 
(23) . The amount of SCPl required to dephosphorylate 
MAPK2/ERK2 phosphorylated RNAP no is 50 to 100 fold higher 
than that required to dephosphorylate TFnH phosphorylated 
RNAP no. Furthermore, the BRCT domain in PCPl that 
facilitates its interaction with RNAP n is absent from 
SCPl. Most importantly, without a better understanding of 
the factors that influence the recruitment of SCPI to 
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complexes containing RNAP lio, it is difficult to interpret 
differences in specific activity in a binary reaction. The 
finding that SCPl dephosphorylates rCTDo and RNAP no with 
comparable efficiency whereas PCPl does not dephosphorylate 
rCTDo even at concentrations 100 times higher than that 
required to dephosphorylate RNAP no, suggest that the 
interaction of SCPl and PCPl with the CTD require different 
molecular interactions . 

10038] During the transcription cycle, protein complexes 
assemble and disassemble on the CTD in a dynamic and 
regulated manner. Ser 5 phosphorylation is detected 
primarily at the promoter region, whereas Ser 2 
phosphorylation is seen in coding regions (12) . 
Phosphorylation of Ser 5 facilitates recruitment of the 
capping enzymes and allosterically activates their activity 
(5-9,44,45) . Given the preference of SCPI for phosphoserine 
5, SCPs are candidates for acting early in the 
transcription cycle, perhaps facilitating the transition 
from initiation and capping to promoter clearance and 
processive transcript elongation. 

t0039] Like PCPl, SCPl phosphatase activity is found in a 
complex With RNAP li and its activity is stimulated by 
RAP74. Mapping studies indicate that the C-terminal domain 
of PCPl distal to the phosphatase domain interacts with 
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TPIIF (15,16). A region near the C-terminus of SCPI shares 
homology with the putative RAP74 interaction domain in CPl. 
Additional studies are necessary to establish if this 
region of SCPI mediates its interaction with RAP74. These 
observations suggest that RAP74 plays an important role in 
regulating phosphate turnover at both Ser 2 and Ser 5 
consistent with the involvement of TFIIF in assembling RNAP 
II into preinitiation complexes and in stimulating the rate 
of transcript elongation. 

£00401 The results of reporter gene assays indicate that 
over-expression of either WT or mutant SCPI can influence 
gene es^ression. The. overescpression of mutant SCPI 
activates transcription several fold from nearly all 
promoters examined (Figure 6B, C and D) whereas 
overexpression of WT SCPi appears to selectively inhibit 
activated transcription from a variety of inducible 
promoter-reporter gene constructs (Figure 6D) . Because 
mutant SCPI is competitive with SCPI, the stimulatory 
effects are consistent with partial inhibition of 
endogenous SCP. The stimulatory effects of the phosphatase- 
inactive mutant suggest that the Ser 5 dephosphorylation 
cycle is highly dynamic and likely carefully regulated. 
Based on the assumption that mutant SCPI is recruited to 
the promoter, overexpression would shift the equilibrium in 
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the direction of increased Ser 5 phosphorylation. This 
could in principle lead to an increase in the efficiency of 
cap formation and promoter clearance. The inhibitory 
effects of overexpressed SCPl are consistent with excessive 
dephosphorylation of the CTD, perhaps leading to a decrease 
in the efficiency of cap formation, promoter clearance 
and/or transcript elongation. Because of the close sequence 
homologies between SCPl. SCP2 and SCP3, mutant SCPl may 
interfere with all 3 family members. The basis for the 
apparent selectivity in inhibiting activated transcription 
is not clear. This could in principle result from an 
increased efficiency in the recruitment of SCPI to a given 
promoter and/or a difference in the rate limiting step 
during initiation or promoter clearance. 

10041] The present studies do not exclude the possibility 
that SCPl influences the state of phosphorylation of 
substrates other than the CTD nor do they address the 
function of related family members which may function in a 
variety of pathways including those located at the cell 
surface and nuclear membrane (46,47). SCPI co- 
immunoprecipitates with the nuclear LIM interactor of LIM 
homeodomain proteins (48) as well as with transcription 
factors such as the T3 receptor, Lhxl, Mespl and 01ig2 
indicating it is not a specific NLI-IP (data not shown) . 
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SCP2/0S4 was initially identified in a chromosomal region 
frequently anplified in sarcomas and brain tumors (34) . The 
ectopic expression of the Xenopus ortholog induced mesoderm 
marker expression and formation of a secondary dorsal axis. 
Mutation Of the DXDX(T/V) motif abrogated the latter' effect 
(49) , suggesting that phosphatase activity is essential for 
duplicate axis formation, m contrast SCP3 /HYA22 was 
initially identified in a genomic region homozygpusly 
deleted in a lung carcinoma cell line (35) . The level of 
expression of each SCP family .ember may thus affect gene 
transcription and have overlapping but also diverse 
biological effects. SCPl, 2 and 3 are widely expressed in 
human tissues ^ith highest expression observed in 
skeletal muscle and lowest expression in brain (33,34). 
t0042J SCPI and related Ser 5 phosphatases may thus 
complement FCPl to control the orderly cycle of 
phosphorylation and dephosphorylation of the CTD during the 
transcription cycle. An understanding of the precise role 
that specific CTD phosphatases play in the transcription 
cycle is dependent on defining the specific point in the 
cycle at which they act and the mechanism by which they are 
recruited to RNAP ii containing complexes. Given the 
importance of the CTD in mediating the synthesis and 
processing of the primary transcript, an understanding of 
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the dynamics of CTD phosphorylation is central to 
elucidating the mechanisms and regulation of gene 
e^qjression. 



Abbreviations 

t0043, CKII. casein Mnase II, CTD. C-te™i„aX detain of 
^ poXynera.e II, pcpi. tfiip associated CTD fihosphatase, 

nuclear LIM interacting factor, PH9P, para- , 
nitrophenylphoephate, PP2C. protein phosphatase 20, P-TBFb, 
positive transcription elongation factor b, RAP74, SHft 
pol^arase II associated protein of 74 kDa (larger 'subunlt 
of TFIIP) , RHRp, poljonerase, SCP, s„«ll cTD 

phosphatase . 

Brief description of the drawings 
[0044] Figure 1. Alignment of se<iaences surrounding the 
catalytic domain and relation of SCP to PCPI 

A. Sequence alignment and relationship of 3 small 
Phosphatases with FCPl. Bracket indicates the conserved 
Signature motif and * indicate critical Asp residues 
involved in phosphatase activity. Previous descriptive 
names and chromosome locations are indicated. Multiple 
alignments were done using clustal W algorithm with ^^ector 
NTI Suite (Informax) . 
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B. Domain structures of ^ PCPl and SCP proteins. 
t0045] Figure 2. SCPl Is a class C CTD phosphatase 

A. PH optimum for SCPl utilizing synthetic peptide 
substrates. GST-SCPl 214 (40 pmol, was incubated with'SO ^ 
PNeP for 60 min at 30»C and phosphatase activity measured 
by the change in A410. 

B. Divalent metal ion requirement for SCPl activity. The 
Phosphatase activity of GST-SCPI 214 (40 pmol, was measured 
in the presence of 20 mM PNGP and varying concentrations of 
[Mg^*] or tea-] . Activity was also measured in the presence 
of 1-10 okadaic acid and l-io m microcystin. The lOpM 
concentration is shown. Mutant SCPl (D96E D98N) was 
inactive (- - . SCP2 also exhibited phosphatase activity 
(□) . 

C. and D. CTD phosphatase assay of PCPl and SCPl on GST- 
CTDo and RNAP lio prepared by MAPK2/ERK2. Increasing 
amounts of FCPl or GST-SCPl 214 were assayed in the 
presence of 75 fmol GST-CTDo (lanes 1-6), 75 fmol RNAP lio 
(lanes 7-12), or 75 fmol GST-CTDo and 75 fmol RNAP Iio 
(lanes 13-18) . Both GST-CTDo and RNAP no substrates were 
prepared by the in vitro phospho^lation of CKll-labeled 
GST-CIDa or RNAP iia by MAPK2/ERK2 . All reactions were 
carried out in the presence of 7 pmol RAP74. CTO 
dephosphorylation of both GST-Cl^o and RNAP no is shown by 
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the increase in mobility of GST-CTDo to GST-CTDa and 
subunit llo to lla, respectively. The difference in the 
intensity of radiolabeled GST-c^oand that of radiolabeled 
subunit no is not a reflection of a difference in the 
amount of substrates present, but of the higher efficiency 
With Which CKII incorporates radiolabeled phosphates onto 
the most C-terminal. serine of GST-CTDa compared to subunit 
Ila. 

(00461 Figure 3. Substrate specificity of SCPl. 

A. Dephosphorylation of RNAP no prepared with various CTD 
kinases. Increasing amounts of GST-SCPl 214 were assayed in 
the presence of 3 . 7 fmol RNAP no prepared by CTDK1/CTDK2, 
TPnH, P-TEFb, MAPK2/ERK2 and Cdc2 kinase. All reactions 
were carried out in the presence of 7 pmol RAP74. CTD 
dephosphorylation of RNap no isozymes by GSTSCPl 214 is 
shown by an increase in mobility of subunit no to that of 
Ila. The results are summarized in the graph showing the 
percent of RNAP no remaining as a function of increasing 
SCPl concentrations. 

B. Bffecta Of GST-SCPl 214 on a 2S peptide consisting of 
heptad repeats containing either Ser s phosphate or Sar 2 
Phosphate. The indicated amounts of SCPl were incubated 
With the phosphopeptlde substrate and phosphate released 
was measured as described under "Experimental Procedures." 
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t0047] Figure 4. Effect of RAP74 on CTD phosphatase 
activity of SCPl and SCP2 . Increasing amounts of the 
indicated forms of SCPl and SCP2 were assayed in the 
presence of 14.4 fmol R^p ho prepared by tphh. Reactions ' 
were carried out in the presence and in the absence of 7 
pmol RAP74. 

t0048J Pisur, 5. «^=1.„ lo=.li,.tion «d .«oci,ticn of 
SCPl with RNAP II. 

A and B. cells were co-stained for the endosomal marker 
EEAl using mouse anti-EEAl and Alexa Fluor 594 conjugated 
goat anti-mouse (red) . Nuclei were detected with DAPI 
(blue) (panel A) . Immunofluorescence microscopy detection of 
endogenous SCPl using rabbit polyclonal IgG 6307 and Alexa 
Fluor 488 conjugated goat anti-rabbit IgG (green) (panel 
B) . second antibodies alone served as a control in all 
cases. 

C. Coimmunoprecipitation of RNAP li and endogenous SCPl. 

Extracts from untransfected C0S7 cells were 

immunoprecipitated using sepharose-immobilized anti-scPl 

IgG 6703 or sepharose-immobilized control IgG. 

immunoprecipitates were resolved on SDS-Page and blotted 
using anti-RNAP II antibody (8WG16) and with the Ser 2 
phosphate epitope specific antibody H5 and the Ser 5 
Phosphate epitope specific antibody H14. Rnap ho present 
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in COST lysates is shown on the left (5% load) and the 
relative ratio of each form of rnap ii in SCPI 
immunoprecipitates to that in lysates is given. The data 
shown is representative of three experiments. 
[0049] Figure 6. Effects of Phoaphatase-active and 
Phosphatase-inactive SCPI on gene expression. 
A. Effects of targeted SCPl 261 on reporter gene 
expression. Gal 4-DNA binding domain-SCPX fusion protein 
expression plasmids were cotransfected in HBK293 cells with 
a Gal 4-TKLuc reporter plasmid and luciferase gene 
expression quantitated. In all panels, results of 
triplicate transfections are shown +/- SD. Data is 
expressed as fold- activation experimental /control) . 

B. Effects of SCPI 261 and mutant SCPI 261 on basal 
promoter activity. The indicated reporter plasmids were 
cotransfected with SCPI or phosphatase-inactive SCPI. 

C. Differing effects of SCPI 261 and phosphatase-inactive 
SCPI 261 on Gal 4-VP16 stimulated gene expression. The 
indicated amounts of SCPI or mutant SCPI expression 
plasmids were cotransfected in HEK293 cells along with Gal 
4-.VP16 expression and Gal 4-TK-Luc reporter plasmids and 
luciferase activity was quantitated. 

D. Effects of SCPI 261 and phosphatase-inactive SCPI 261 on 
ligand activated receptor activity. The indicated receptor 
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expression plaeoUda were cotrax^afeoted wxth their cognate 
pronK>ter-reporter plaamida with or without SCPl or .Mtant 
SOPl expression plaa.dds. Cells were treated or untreated 
with receptor-specific ligands as described under 

ExperimentalProcedures • 

B. Competitive effects of ,„,tant SCPl 2S1 with SCPl 
The indicated concentrations of SCPl and mutant SCPl 
expression plasmida were cotransfected with LMXx and B47 
expression plas,.ids and the rat insulin i pronoter- 
luciferase reporter gene, l^ciferase activity was „.asured 
as described. 

SCPl nucleic acid sequence on chr«aoaome 2 

SSS^S ™agt cc.ccta.cc .c.cc^,.. 

gggagccttg aaaclgclcc fggS?ccat T^^J'^T^ aagccgttgc cctttlLgg 
catgttgtaa caaagttLc t?gcgccco ctcc?tSo^ g^ctcgcggg aaggaaae?! 
cccctcccct ccggagctcg cggggatccc tcon^^nf gaacctggct 
cccgattccg gccccagccg giSSaoSc "° ccctcccctc ccccccgcgc 

ggagcgcgcc cggcccLtl Sc^Sgg ??"T^*== cggccg|a|c 

ctcggggccc gctgcggggc aaag|tacli galctoeaa^ ^cagatcagc aaggaggagg 
gccgtgggag gagagaaggg gcc|igatcf gagggggccg aagccggggc 

gcggccgcct tagctgtill cgaffctccc ^occcoff^^ agccgccgcc gccgcccclg 
actcagagga ggctcagaga cicglggcgq Scctloo^^ ggagcaggga gagagttt|I 
ctcgaggtga ggaaactgig gLSfS Saaol!^^^ ^tttggggcg ctcctgtclg 
ggcattgcgt ggtgeat|g| ?gc?cccccf c<=t:tcggggg tttcctggca 

ggagctgagg agg|cgcc?a Sggcoacc? acta!f^*'?° caatggggct gtgagajjgg 
cccccccggg ctgcggtccg gtlggg^c^t ^aaf^f^ ° cccacccoca 
gggaggcttg gagggatctc ccg^^I^c^c !cSf?I?It f^^^^^^tga cagcaggctg 
cgcgtggagg cgccgacccc ctcggaggca cSfoaaaf^ ^^*===^<^=^a cttcgcgtca 
gcttggcgcc cgcggggaga gtcjlgcfcc taSfo 9g<=«ggcact tccaagagtc 
gccgccccga cgaggctgcg ?ccfc!agcg S?ct?aa^^ gcaccacccc gcaaagcctc 
ctccttttcc ccgtgtggac ctcagga^c? Scaf?!^ Sgggtggggg ggtctgtcbt 
cctgggtctg gctgccccgg aactSgggc aSSaSf^ cccaggtctg cccaccctcg 
aggggggtgg ggtgggaggg gtatct£?| atSScSc T'^'' agggggccg| 

tgggcggggo agggcaaaca gatggccact SSeflhS^ ^^^S^^tccag gtcggaggtc 
ccctgcctct gggcccagcc gcallgSga ??tSh!^? '='=*=^gg<^<=g° gggactgcac 
gggagggcag gggtggactg ccc?ggg??S caaaccSf ^l^l^'^^^^^ agaggatgga 
caggtaaggt ggggtcagga ggca??£aa SggSa? ^2 ^aggggtgct 
gtgagaggca gggagagagc accccalgac c?Sc%' Sf^Sc^gS T^^, 
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gggcgcctta atacctgcta aacctafct-hrr 

tgtggagccc tgacag^ggc Itttc^gagf llattltll^ Scaggagcct tggagttgat 
gaaggggcca cggcaagatc ctcctggccc aafo^^P^ ctgccttcgt gtgtctggat 
ttctgcaggc cagtgtgcac cgagcSIc! ac??S^°^^ acctgggcac acatgcJgga 
tSaof"^° "cccagaag cfc^ajgcc S|2a?c2 "r^^""" 9StgLcS: 
tctgccggga tgatggggag gccctgcctg c?Scaoo^^ ccactcactc ttctgctgEg 
agaatggcgc catccctaag gtgcgtgggi acca«o?^^ Sgcgcccctg cttgtggagi 
cagtcttcag ggotttaggg laagSl??? ctS!??^^ 9°*=?cggggg cacItSa?? 
acctgaaagt gcagagtagg aggg?ISg cScccSo^ ttttcaggat ggactffcag 
gaaactgaat tctccctcat aagtgglaSc tt^^^^^K^ caggccctgc ccactg^ggf 
caaagagcca agaggcctac ccLScSa aa^^^ cttggttttt agagajg??? 

aagtatctgt tcctggggcc tg^Stcc? Sao!^^°^^ gggcaaagct g|gaa|ggg; 
gggaggaagg aocaggcccg gi2gagg« ccS^r'^^^ «=tagggggag a5cc?|?|? 
accccagtcc aatacctgc? loftgagfcc tlaaatltl^ ccocgccctc cctacagclg 
gtcatcgacc tggacgagao cctgSgfao ap??ce^^^^ a^tcagacaa gatctglgtg 
ccctcagccc gggtctcggg gggcatlccc Sa^!o^ ^ aggtgggccc tgctcaacal 
gaccccatgc cctggggctc ^fLtccaac "caocfa^n ^fff^^^^^S gtgtgtgct? 
gaacaacgcg gacttcatca tccctgtaaa oft^f!f^ cttttccccc cacagccagf 
caggaagagg cagtggtggg cttggSfot Itnl^^^^ gtggtccacc aggtgaggic 
ccggagcgcc tcggatgliJ atglataca ^otoS^f^ cctaggctct tIccLSIt 
gagacttgtc ccaaagtcac acaSaaccho ^9*'99^^*=9t cagaggccca gagagggtat 
gtgggctcct cctctfggct clTcTtTol g?£?ccc?c ag?c?g3; 
cagttcaagt aattcaggat aggttgtqta tllt^t fccccaccct gcccgggacc 
ggggaccggt gccctgcagc cttaaaa^«f *'*=9'="=«=agcc tgttctccat tacttgqcte 
tgaggttgag gcagg^galg Tslttstll ttllTaTt tgcac?Sgc 
tggagtggaa gttttgatcg ttttchof^« ttagggacct ogtgaggtag gactgqccL 
cagggtagct ggccalgcgl agccjggf? t^^t^f^'' acagtttccc cagSSgJ? 
cctggcccat gccctgccf| J^tcggS? cccScS "'=^'=9^9-9 cc?ctgc??c 
cctcactggc ccgcccccca ggtctaLto gccccactgg cccgcagccc 

cagcgaatgg gcgagctctt fgaaStSf cSttf^^^ ctcatgtggi tgal???ctg 
cccaoagggg tcccggggca accctgccc? cltlln^^^ ctagcctcgc caaggtgagc 
acctgcgggc cccaggatga cccac^tcct gSccSota '^"^^^^^^^ agcIcaJtS 
tgctggacaa atggggggcc ttccgggccc alltlt^fl cgcagaccca gtagctgacc 
gggggaacta cgtgaaggac ctgagcgS f agagtcctgc gtcttccacc 
tggacaattc acctgcctcc ta?g?ct?S aSSafn^ r*=^*=^^^95 9tgctcatcc 
ctgggactgg gacaggagct gaaacce^f^ atccagacaa tgctgtgagt gcgggctaaa 
ctcttggatc cccagftfgg S^gggS? octcaal't^ tccattcagg LfSSlS 
tgcctgccgc ccactccco? Ia?cfalc?a =ttcctgcat tcattgcrtj 

acaaagcaga gcatctgcca tgcacaaaal ^^°'=9^^9°° atatggtctt ttcccctcgf 
tcaatttttc gaattg?cag Sgtga???! gcaacggagt ttggaaag?t 

tctgattggg aattttaaci aalSaaa^a o«^?^*'°° acagaattaa ttlagtgggt 
ctcatgcctg taatcccagc actSggSaJ lc?Jafo^^^ tatccggtog ggtgfag?g| 
gttcaagacc agcctcggca acatafSaa fccft^f gcaggtagct gagccclg?a 
gccaggcgtg gtggcgcatg cctgglScc ooocfc^Jf '' tacaaaaaat acaaaaa?ta 
S^ocaa^"" -tggaggcag aggf?g?gj? gfSSat gf^^^^^^ag gtaggagtat 
tgggcaacag agcaagaccc tgcctcaaaa f !f ° ^ *^ tgtgccactg cactctagcc 
gataaggtta ggaattgtga a|o?t"g« gtatccaagt gcttcgScI 

attgctgtca aaaaagtttg aacactgtao oJf«^ ^! ataaatgtgt tttcctgggg 
?cSS?^°' -catagggc? ggcctSa!! tlalttlfH "cagaaact gcatga???| 
tcttcctaca cccacttccc caaataao«« "°'^9fa=<=c aggaccacct gcccccotea 
cagagctcca cgacctcoto ccl?tcttlf ^ggcctcgtg gtttgacaac atgagtSacJ 
oagtgctcag gcagccacgg ccagJqaac? ^^^^^^•='=^9 ccgtgtggac ga?g?g?ac? 
tgaccaabga tacccacacc ?cc?cccaaa Itt^^f^^^^ atggggccag laccgcJcc 
catgggocgo cgocacactc ag?gccat?a 00!^^°°° aggcctttgt taggalaacc 

A d i i-F'S s:-— :-b:s 

KSg- Ki^ss 
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ipSSS cMg IZ^ttll- ircT^iiii — 

aaggcagcag gagtctgtcc ttca^cccc a!2?SaS ??ff' ^^"'^ accactggg| 
agggctgctc cctgcatcat ccaaacaa^« ? ^ ^ ttctccaggg ccgacagctg 
99cttggctc ccccagctcJ gagStggg| gSagSaa ^f^'^^*^'^-- ccagccScgl 
ataaatatgt acatgtgtat atagat???? !aaaa!!«^f Sacccccctt gtggtgccat 
gagacacccc tcccttgccc ctt?ccSgg c?cSaaa?t SStcagggta 
tttttacatt ttttaaactg ctattttcta 933gggaggg agggaaagga 

ccctgtcctc tgtccctca? ac^cct^tg? ltl?ttllll 3g9gcccSg 
gagcaccttc tgtgcccagc atatgctala llntrZ t ^^^tcaaaaa aacatttctt 
tacgccagct catcagtgcc tcct?gcc?! tcc?tcacc« "^tgtgtgt ggggggtctc 
ggaggtggga ccttctgtgg ggtttSgga tctccSaof! ^*^9°°"tgg gggatctgta 
tcccctgtgc caacccatct IltacaSfc aclallff.tf S^^^^Sf^^c^a 3ctgtcccct 
ctcccaggat atcctgcctt ccaac?It?t ctof! ^ cccctgotgg ctgggggcag 
eggaggtcaa ggccttgggc tctcccSgg gSaacgpt ^0°°*^"^° atggcgatt? 
tgccaagggg gatgtcaagt ggtgatgtcg tlllllt^^l taaggggacc cacataccag 
ggggggtgaa tatggttggc c?goatSgg ^gSft^ccc attt""":^^ 3cgggtgggc 
actgagagcc ctagtgtgat gagaactaaa oloff! atttaagtgc cttctctgtg 

ggttattgcg gggga?t?ca IcSgtg^g ^SgScct^ itHT^^'' ctgcttctg? 
cccccccccc gcttcagcac acctaglSg c?2Sat«o f'=^=°'^9«=93 ctgccgtggg 
tcgacacttc aggtgggaag aqcaacahnf ^^^.^^^^^^3 agggaggggc tgcccggccc 
cgtctacgtc t^ctHalt ggg^It^gg lllTcl^l "gagcctcc aggct^Jgct 
ggactaatta agtgcctgga laLga^gac aggItac?S ^laff^^^^ cctattggca 
aaggcctggg taggggcagt cctaaaa^a^ 7.^7. ^ tagctggaaa gcaaaathcg 

gagcagaggg gcgS^ct^g ^gcJIc^ccg ISfS^f 3333cgtggc ctctgccctg 
tgggtggagg ctccagcggc aLIItt^? alnf^tTf^ cggggagcgg ggcgaggctc 
ctctgccggg cggcctggt g^gcSSa ccaS^^f ^ ^ggcgggctc cggcgcctgg 
gcctctgagc agccafctSc l^gt^cla ccfI!^^f°^ gccccccccg cccctctgcg 

,.c«c.,... «c„4^ ^Jg~-== e«c«c.J 
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What is claimed is 

1. An isolated nucleic acid molecule selected from the 
group consisting of : 

a) the cDNA deposited with ATrn 

a-ucui wicn Arcc as Accession Number 

BE300370; 

b) the CDNA deposited with ATCC as Accession Number 
AL520011; and 



c) the CDNA deposited with ATCC as Accession 
M63, 

or a complement thereof. 



2. The nucleic acid molecule of claim 1 further 
comprising vector nucleic acid sequences. ' 

3. The nucleic acid molecule of claim 1 further 
comprising nucleic acid sequences encoding a heterologous 
polypeptide . 



4. A host cell which contains the nucleic acid molecule 
Of claim 1. 



5. The host cell of claim 4 which is a mammalian tfost 
cell. 
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6. A non-human mananalian host cell containing the nucleic 
acid molecule of claim 1. 

7. An isolated polypeptide selected from the group 
consisting of : 

a) the polypeptide encoded by the cDNA insert 
deposited with ATCC as Accession Number BE300370; 

b) the polypeptide encoded by the cDNA insert 
deposited with ATCC as Accession Number AL520011; and 

c) the polypeptide encoded by the cDNA insert 
deposited with ATCC as Accession Number AL520463. 

8. The polypeptide of claim 7 further comprising 
heterologous amino acid sequences. 

9. The polypeptide of claim 7, wherein the polypeptide is 
a phosphatase. 

10. The polypeptide of claim 9, wherein the phosphatase is 
a small C- terminal domain phosphatase (SCP) that 
dephosphorylates RNA polymerase II. 

11. The polypeptide of claim 10, wherein the phosphatase 
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is small C- terminal phosphatase-1 (SCPl) , small C- terminal 
phosphatase-2 (SCP2) , or splice variant thereof, 

12. The polypeptide of claim 9, wherein the phosphatase 
dephosphorylates serine 5 within the C-terminal domain of 
RNA polymnerase II. 

13. An antibody which selectively binds to a polypeptide 
of claim 7. 

14. A method to increase gene e^qpression by administering 
a nucleic acid encoding a phosphatase inactive mutant of 
SCP or its gene product. 

15. The method of claim 14, wherein administrating a 
nucleic acid encoding a phosphatase inactive mutant of SCP 
or its gene product upregulates gene transcription by RNA 
polymerase II. 

! 

16. A method to decr;ease gene expression by enhancing the 
expression of wildtype SCP, or administering a nucleic acid 
encoding wildtype SCP or phosphatase active mutants of SCP 
and their gene products. 
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17. The method of claim 16, wherein enhancing the ^ 
expression of wildtype SCP, or administering a nucleic acid 
encoding wildtype SCP or phosphatase active mutants of SCP 

and their gene products down-regulates gene transcription 
by RNA polymerase II. 

18. A method of modulating the activity of a polypeptide 
of claim 7 comprising contacting the polypeptide or a cell 
expressing the polypeptide with a compound which binds to 
the polypeptide in a sufficient concentration to modulate 
the activxty of the polypeptide. 

19. A method for identifying a compound which modulates 
the activity of a polypeptide of claim 7 comprising: 

a) contacting a polypeptide of claim 8 with a test 
compoxmd; and 

b) determining the effect of the test compound on 
the activity of the polypeptide to thereby identify a 
compound which modulates the activity of- the polypeptide. 
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f 

Abstract 

[00501 The transcription and processing of pre-mRNA in 
eukairyotic cells are regulated in part by reversible 
phosphorylation of the C-terniinal domain of the largest RNA 
polymerase II siibunit. The CTD phosphatase, FCPl, catalyzes 
the dephosphorylation of RNAP II and is thought to play a 
major role in polymerase recycling. This study describes a 
\family of small CTD phosphatases (SCP) that preferentially 
catalyze the dephosphorylation of Ser 5 within the 
consensus repeat. The preferred substrate for SCPl is RNAP 
II phosphorylated by TFIIH. Like FCPl, the activity of SCPl 
is enhanced by the RAP74 sxibunit of TFIIP. Expression of 
SCPl inhibits activated transcription from a number of 
promoters whereas a phosphatase- inactive mutant of SCPl 
enhances transcription. Accordingly, SCPl may play a role 
in the regulation of gene expression, possibly by 
controlling the transition from initiation/capping to 
processive transcript elongation. 

10261412. doc 
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